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BLAST 



Basic Local Alignment Search Tool 



• Your search parameters were adjusted to search for a short input sequence. 
tol\ and Reoubio;: Save Search Strategies ^ormarrina options Download 
Protein Sequence (8 letters) kesidues 73-so of seq id no: 12 
Results for: pci|27776l^one(8^r3 

Your BLAST job specified more than one input sequence. This box lets you choose which input sequence 
to show BLAST results for. 

Query ID 

Id 27776 
Description 

None 
Molecule type 

amino acid 
Query Length 

8 

Database Name 

nr 

Description 

All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental 
samples from WGS projects 
Program 

BLASTP 2.2.22+ Citation 



Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb 
Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs", Nucleic Acids Res. 25:3389-3402. 
Other reports < • * > " - ie N new 



Search Parameters 


Program 


blastp 


Word size 


2 


Expect value 


200000 


Hitlist size 


100 


Gapcosts 


9,1 


Matrix 


PAM30 


Threshold 


11 


Filter string 


F 


Genetic Code 


1 


Window Size 


40 



Database 

Posted date Jan 3, 201 0 5:44 PM 

Number of letters 3,505,793,397 
Number of sequences 1 0,274,250 
Entrez query none 
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Karlin-Altschul statistics 

Params Ungapped Gapped 

Lambda 0.368907 0.294 

K 0.286243 0.11 

H 1.92795 0.61 



Results Statistics 

Length adjustment 0 
Effective length of query 8 
Effective length of database 3505793397 
Effective search space 280463471 76 
Effective search space used 280463471 76 



Show Conserved Domains 

No putative conserved domains have been detected 



Distribution of 101 Blast Hits on the Query Sequence 



[21 



An overview of the database sequences aligned to the query sequence is shown. The score of each 
alignment is indicated by one of five different colors, which divides the range of scores into five groups. 
Multiple alignments on the same database sequence are connected by a striped line. Mousing over a hit 
sequence causes the definition and score to be shown in the window at the top, clicking on a hit 
sequence takes the user to the associated alignments. New: This graphic is an overview of database 
sequences aligned to the query sequence. Alignments are color-coded by score, within one of five score 
ranges. Multiple alignments on the same database sequence are connected by a dashed line. Mousing 
over an alignment shows the alignment definition and score in the box at the top. Clicking an alignment 
displays the alignment detail. 



Color key for alignment scores 




http://blast.ncbi.nlm.nih.gov/Blast.cgi 



1/4/2010 



NCBI Blast: Protein Sequence (8 letters) 



Page 3 




http://blast.ncbi.nlm.nih.gov/Blast.cgi 



1/4/2010 



NCBI Blast: Protein Sequence (8 letters) 



Page 4 



Sequences producing significant alignments: 



Score 

(Bits) Value 



urinary plasminogen activator [Rattus norvegi. 
plasminogen activator, urokinase [Rattus nor. 
K,,,BAa RecName: Full=Urokinase-type plasminogen, 
plasminogen activator, urokinase [Mus muscul. 
middle S protein [Hepatitis B virus] >gb|ACX36. 
middle S protein [Hepatitis B virus] 
S protein [Hepatitis B virus] 
surface antigen [Hepatitis B virus] 
S protein [Hepatitis B virus] 
),':.] hypothetical protein [Paramecium tetraure. 
HBsAg [Hepatitis B virus] 
HBsAg [Hepatitis B virus] 

51C surface protein [Paramecium tetraurelia] 
HBsAg [Hepatitis B virus] 
surface antigen [Hepatitis B virus] 
truncated S protein [Hepatitis B virus] 
truncated large S protein [Hepatitis B virus] 
7.1; hypothetical protein [Monosiga brevicolli. 
5 . i. j GE11868 [Drosophila yakuba] >gb | EDW91787 . . 

hypothetical protein PPL_10456 [Polysphondyliu . 
8 . 1 \ Cof-like hydrolase [Dickeya zeae Echl591]. 
BNR/Asp-box repeat domain-containing protein, , 
. 1. j sortilin, putative [Toxoplasma gondii ME 4 , 
•j.lj hypothetical protein LOC100277631 [Zea ma. 
5,1; hypothetical protein [Monosiga brevicolli. 
5 , 1 i predicted protein [Physcomitrella patens . 
8 . 1 ; FAD dependent oxidoreductase [Pseudomonas . 
predicted protein [Physcomitrella patens . 
hypothetical protein DDB_G0283853 [Dictyoste. 
hypothetical protein DDB_G0283869 [Dictyoste. 
; hypothetical protein DDB_G0283787 [Dictyoste. 
hypothetical protein PANDA_005925 [Ailuropoda . 
hypothetical protein PPL_05237 [Polysphondyliu. 
carbohydrate-binding domain-containing protein. 
. ■ ; pectinesterase [ Ruminococcus flavefaciens . 

periodic tryptophan protein 2, putative; pred. 
4.1. j hypothetical protein SORBIDRAFT_07g026515 . 
■!,:.; transcription initiation factor brfl [Sch. 

lipoprotein [Escherichia coli] >gb|ACL520. 
. cytochrome C family protein [Persephonell . 
7.2! hypothetical protein TTHERM_01486710 [Tet. 
.11 DNA topoisomerase (ATP-hydrolyzing) [Opitu, 
hypothetical protein OsJ_04192 [Oryza sativa J. 
predicted protein [ Thalassiosira pseudona . 
putative lipoprotein [Escherichia coli] >, 
hypothetical protein IscW_ISCW011495 [Ixo. 
5.;: hypothetical protein BRAFLDRAFT_1 2 3314 [B. 
2.1; hypothetical protein BRAFLDRAFT_8 6993 [Br. 
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;: ■■ ■:" I XP.......00 1.6 I. : :■! 08 . : i hypothetical protein BRAFLDRAFT_8 9 993 [Br... 1 1 . 0 7434 

XX... 00 ".: 1 .-. 4330 .! I PREDICTED: similar to integrin beta Hrl [... 01.0 7434 

XX;X X1XXX1 X:-0 . ; : unnamed protein product [Homo sapiens] 21. 8 7434 

AX XX; 00IA3. U LOC100158602 protein [Xenopus (Silurana) tropi... 2 1 .is 7434 

r.---.i • XV' „. ! .;0J 912476. i i unnamed protein product [Podospora anseri... 21 ,8 7434 

;: ■■; XX XV ...00 • 9-0 728:; . • unnamed protein product [Podospora anseri... 21.8 7434 

;: ■: 2 X XI ...00 : " 70200 . ; predicted protein [Physcomitrella patens ... • 1 . 0 7434 

r.::l : XX...0VXX- V0XX V 3. 1 ; p300/CBP acetyltransf erase-related protei... 21, 9 7434 

XX..0iX.':00 XX'X • j p300/CBP acetyltransf erase-related protei... 2;. 9 7434 

;;XX XX. .09700 0 01 . Vj hypothetical protein PEPMIC_01220 [Parvimo... 0 : . c- 7434 

-;X XX.X;0 X v 99 . 7 : hypothetical protein NCU03244 [Neurospora cr... 1 .. 8 7434 

?: ■■: 9 : XX. ..00 X: XX; 903 . : : PREDICTED: fibrillin 1 [Equus caballus] 21.8 7434 

>: : rx:x...0: X; :x:X~. : : hypothetical protein MDG893_13589 [Marinob... 21.9 7434 

'\;h\ 9XX.A90;X; . 1 : fibrillin 1, isoform CRA_a [Rattus norvegicus ] . . . 2;. 9 7434 

,;XX XXX099 71 . 1 : fibrillin 1, isoform CRA_b [Rattus norvegicus] 2 ; . 2 7434 

A; : XXX :XX. 22 . : fibrillin 1, isoform CRA_a [Mus musculus] >gb I . . . 2 X, .9 7434 

■ ; XX ... : ;0 ■ 29 :x.b0 . ■ : hypothetical protein An08gl0840 [Aspergil... 21,8 7434 
;;XX X2...0 0 ; XX:xxX; . ■ : PREDICTED: hypothetical protein [Monodelp . . . 21.8 7434 

■;Xi 2. XX 9 "900 1 hypothetical protein Osl_04550 [Oryza sativa I... 2i . 8 7434 

r?X: XX... 0 010 0 33- 03 . 1 I hypothetical protein [Paramecium tetraure... 2 1 . 8 7434 

XX. i XLxX9,,>9X9.:2-'.0.:XX;.Xj. conserved protein [Microscilla marina ATCC . . . 2 1 . 2 7434 

x.XXXXXVVOXX ... 2; fibrillin 1 (Marfan syndrome), isoform CRA_a [... 2.2, .2 7434 

r.ef X09...000129 , .-, ; fibrillin 1 precursor [Homo sapiens] >sp|P35... 21.8 7434 

~X;!X9 i X...032019 . 2 ; fibrillin 1 [Mus musculus] >emb I CAM22806 . 1 1 ... 21.8 7434 

XXxj.2 : ^,,,.0.:9X9LX9.:..X.9 drug/metabolite exporter family transporter . . . .21... 9. 7434 

rsf IXX...OOU03200. 1 I PREDICTED: fibrillin 1 [Pan troglodytes] 21 . 8 7434 

X^x.,,00 10 90 899, 1| Os01g0865600 [Oryza sativa (japonica cult... 21...8 7434 

ref IXP...337600X3I PREDICTED: similar to family with sequence s... 2.1,8 7434 

:.-;;f ;XX...332 -.00 , 2 ; PREDICTED: similar to CG16791-PA [Apis melli... 21,8 7434 

XXilLxT,,,X9.9x 0M.09..-JJ. PREDICTED: estrogen-related receptor beta... 2 i . 8 7434 

r&f: IX?... 001. 3. ;.310'>.l.j PREDICTED: similar to fibrillin 1 precurs... 2.1. ...8 7434 

XX.0X.9.X...0 1 .x9X.XXx.Jx. hypothetical protein RED65_16411 [Oceanoba... 21.8 7434 

rel : XX ..201 700.11 PREDICTED: similar to Fibrillin-1 precursor ... 21,2 7434 

::e9 : XX ...233008 .21 PREDICTED: similar to Fibrillin-1 precursor ... 2.1,8 7434 

XX ; XVXX0207 7, ;. fibrillin 1 variant [Homo sapiens] 21. 8. 7434 

XX ; ; : XXD01 907 , ;. : bHLH transcription factor-like protein [Oryza... 21.8 7434 

■;1- : XAT30 1 8"', 1 : PilL [Escherichia coli] 21.8 7434 

r.;XX IX... iOoO'X) . ; : lipoprotein [Escherichia coli] >gb | AAL1 8 82 7 . . . . 21.8 7434 

cXX XXX021X.0 . 1 XX: AXxX; 9..X PilL [Escherichia coli] 01.9 7434 

5 : XXX 1.9 1 02 . I novel protein similar to MHC class II beta ch... .1X1 . is 7434 

■ ;X X.X...00 XI i0o;3. 1 : hypothetical protein TTHERM_01006510 [Tet... 21,8 7434 
X: : .1X2X22; XV . IX mutant fibrillin-1 [Mus musculus] 2 V . 8 7434 
■:XX XXXX 0 IX. X fibrillin-1 [Mus musculus] 0.1.9 7434 

■; : XX... 9 0 :■. V 2 3.11 PREDICTED: similar to Fibrillin-1 precursor ... 21.9 7434 

;'<:l: XX.. .20 17X3 . i PREDICTED: similar to Fibrillin-1 precursor ... .01.8 7434 

::e£ : XP...8X1 '?92 . 1 PREDICTED: similar to Fibrillin-1 precursor ... 21,'g 7434 

■ X XX X..1 1 00: ;. 1 fibrillin 1 [Rattus norvegicus] >qb|A!X: !4438. . . 21,8 7434 

;.: : XXXI O'V X; . fibrillin 1 [Homo sapiens] 2 1 . X 7434 

r 0 | XX... 0 0 2 6 ;; 3 3 2 9 . 1 C. briggsae CBR-LAM- 1 protein [ Caenorhabd . . . 11.9 7434 
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••=.: i:\ NP...00 1 00 i 7?.- . I ; fibrillin 1 precursor [Sus scrofa] >sp|Q9... ; ;. . 7434 

;,:r: ;M ! :HiUV- fibrillin :•. I . > ! 7434 

^p;00:i MM;, i j f 8-< • JKjUiCE RecName: Ku 1 1 Fibri 1 1 in 1 ; Flags: Prec... 2 ■ . 8 7434 

- - - - o Select All V o o * v-o N $v v • eo . o-\ ^ \\ oe, ^ MEW 



>emb | CAA46601 . 1 | 13 urinary plasminogen activator [Rattus norvegicus] 
Length=432 

GENE ID: 25619 Plau I plasminogen activator, urokinase [Rattus norvegicus] 
(Over 10 PubMed links) 

Score = 25.2 bits (52), Expect = 708 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTGQC 
Sbjct 283 RTSTGQC 289 



>ref | NP_037217 . 3 | li-EI plasminogen activator, urokinase [Rattus norvegicus] 

gb| AAI05860 . 1 | HH Plasminogen activator, urokinase [Rattus norvegicus] 

gb | EDL86256.il IH rCG41849 [Rattus norvegicus] 
Length=432 

GENE ID: 25619 Plau I plasminogen activator, urokinase [Rattus norvegicus] 
(Over 10 PubMed links) 

Score = 25.2 bits (52), Expect = 708 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTGQC 
Sbjct 283 RTSTGQC 289 



>sp | P29598 . 1 | UROK_RAT RecName: Full=Urokinase-type plasminogen activator; Shor 

Short=U-plasminogen activator; Contains: RecName: Full=Urokinase-type 

plasminogen activator long chain A; Contains: RecName: 

Full=Urokinase-type plasminogen activator short chain 

A; Contains: RecName: Full=Urokinase-type plasminogen activator 

chain B; Flags: Precursor 

emb| CAA45028 . 1 | HI ur ikinase-type plasminogen activator [Rattus norvegicus] 
Length=432 

GENE ID: 25619 Plau I plasminogen activator, urokinase [Rattus norvegicus] 
(Over 10 PubMed links) 

Score = 25.2 bits (52), Expect = 708 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTGQC 
Sbjct 283 RTSTGQC 289 



>ref | NP_032899 . 1 1 iSE plasminogen activator, urokinase [Mus musculus] 

sp| P06869 . 1 |UROK_MOUSE E RecName: Full=Urokinase-type plasminogen activator; Sh 
activator; Short=uPA; Contains: RecName: Full=Urokinase-type 
plasminogen activator long chain A; Contains: RecName: 
Full=Urokinase-type plasminogen activator short chain 
A; Contains: RecName: Full=Urokinase-type plasminogen activator 
chain B; Flags: Precursor 

emb| CAA26231 . 1 | IS unnamed protein product [Mus musculus] 
gb|AAA40539.1| urokinase-type plasminogen activator [Mus musculus] 
gb|AAl20710.1| 0 Plasminogen activator, urokinase [Mus musculus] 
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gb| AAI20714 . 1 | B Plasminogen activator, urokinase [Mus musculus] 
gb | EDL01484.1| plasminogen activator, urokinase [Mus musculus] 

db j | BAH02680 . 1 | urokinase-type plasminogen activator [Mammalian expression vect 
pmAlbEPintPlauGH] 
Length=433 



Score = 25.2 bits (52), Expect = 708 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 



Query 1 RTSTGQC 7 

RTSTGQC 
Sbjct 284 RTSTGQC 290 

>gb|ACX36965.1| middle S protein [Hepatitis B virus] 
gb|ACX36975.1| middle S protein [Hepatitis B virus] 
Length=243 

set = 949 

.tives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 133 TSTGQCK 139 

>gb| ACX36957 . 1 | middle S protein [Hepatitis B virus] 
Length=243 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 133 TSTGQCK 139 

>gb| ABU97123 . 1 | S protein [Hepatitis B virus] 
Length=57 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 9 TSTGQCK 15 

>gb|ABI17005.1| surface antigen [Hepatitis B virus] 
Length=57 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 9 TSTGQCK 15 

>dbj | BAG12008.il S protein [Hepatitis B virus] 
Length=254 



Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 144 TSTGQCK 150 

>ref |XP_001450163.1| hypothetical protein [Paramecium tetraurelia strain d4- 

emb | CAK82766 . 1 | B unnamed protein product [Paramecium tetraurelia] 
Length=2233 
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GENE ID: 5035948 GSPATT00017138001 I hypothetical protein 
[Paramecium tetraurelia strain d4-2] 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 1936 TSTGQCK 1942 



>gb|AAX59509.1| HBsAg [Hepatitis B virus] 
Length=101 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 5 TSTGQCK 11 



>gb|AAX59315.1| HBsAg [Hepatitis B virus] 
Length=101 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 5 TSTGQCK 11 



>gb| AAA61740 . 1 | 51C surface protein [Paramecium tetraurelia] 
Length=2233 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 1936 TSTGQCK 1942 



>gb| AAW65562 . 1 | HBsAg [Hepatitis B virus] 
Length=226 

Score = 24.8 bits (51), Expect = 949 

Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTGQCK 
Sbjct 116 TSTGQCK 122 



>gb | AAD13661 . 1 | surface antigen [Hepatitis B virus] 
Length=226 

Score = 23.5 bits (48), Expect = 2294 

Identities = 7/8 (87%), Positives = 7/8 (87%), Gaps = 0/8 (0%) 

Query 1 RTSTGQCK 8 

RTSTG CK 
Sbjct 115 RTSTGPCK 122 



>gb| ACB45647 . 1 | truncated S protein [Hepatitis B virus] 
Length=171 

Score = 23.1 bits (47), Expect = 3077 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query 1 RTST GQCK 8 

RTST GQCK 
Sbjct 112 RTSTTGTGQCK 122 



>gb | ACB45646 . 1 | truncated large S protein [Hepatitis B virus] 
Length=345 
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Score = 23.1 bits (47), Expect = 3077 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query 1 RTST GQCK 8 

RTST GQCK 
Sbjct 286 RTSTTGTGQCK 296 

>ref |XP_001750707.1| 0 hypothetical protein [Monosiga brevicollis MX1] 

gb | EDQ84520.il predicted protein [Monosiga brevicollis MX1] 
Length=1216 

GENE ID: 5895935 MONBRDRAFT_3 0197 I hypothetical protein 
[Monosiga brevicollis MX1] 

Score = 23.1 bits (47), Expect = 3077 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query 1 RTSTGQ CK 8 

RTSTGQ CK 
Sbjct 451 RTSTGQVIACK 461 

>ref |XP_002092075.1| El GE11868 [Drosophila yakuba] 

gb | EDW91787.1| M GE11868 [Drosophila yakuba] 
Length=817 

GENE ID: 6531266 Dyak\GE11868 I GE11868 gene product from transcript GE11868-RA 
[Drosophila yakuba] (10 or fewer PubMed links) 

Score = 22.7 bits (46), Expect = 4129 

Identities = 7/8 (87%), Positives = 7/8 (87%), Gaps = 0/8 (0%) 

Query 1 RTSTGQCK 8 

RTSTGQ K 
Sbjct 389 RTSTGQSK 396 

>gb | EFA75884.1| hypothetical protein PPL_10456 [Polysphondylium pallidum PN500] 
Length=641 

Score =22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

T+TGQCK 
Sbjct 242 TATGQCK 248 

Score = 19.7 bits (39), Expect = 32333 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TS GQCK 
Sbjct 158 TSDGQCK 164 

>ref |YP_003006388.1| E Cof-like hydrolase [Dickeya zeae Echl591] 

gb|ACT08909.1| B Cof-like hydrolase [Dickeya zeae Echl591] 
Length=279 

GENE ID: 8120965 Ddl591_4113 I Cof-like hydrolase [Dickeya zeae Echl591] 
Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTG+C 
Sbjct 84 RTSTGEC 90 

>gb| EEE22595 . 1 | BNR/Asp-box repeat domain-containing protein, putative [Toxoplas 
gondii GT1] 

gb | EEE31368 . 1 | BNR/Asp-box repeat domain-containing protein, putative [Toxoplas 
gondii VEG] 
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Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTG+C 

Sbjct 374 RTSTGEC 380 

>ref |XP_002368431 . 1 | 19 sortilin, putative [Toxoplasma gondii ME49] 

gb | EEB01291.1| sortilin, putative [Toxoplasma gondii ME49] 

Length=962 

GENE ID: 7897795 TGME49_090160 I sortilin, putative [Toxoplasma gondii ME49] 
Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTG+C 

Sbjct 374 RTSTGEC 380 

>ref |NP_001144615.1| HJE1 hypothetical protein LOC100277631 [Zea mays] 

gb|ACG41806.1| E hypothetical protein [Zea mays] 
Length=496 



Score =22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTG+C 
Sbjct 189 RTSTGEC 195 

>ref |XP_001742225.1| EH hypothetical protein [Monosiga brevicollis MX1] 

gb | EDQ92463.il SH predicted protein [Monosiga brevicollis MX1] 
Length=1201 

GENE ID: 5887979 MONBRDRAFT_2 2150 I hypothetical protein 
[Monosiga brevicollis MX1] 

Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTG+C 
Sbjct 41 RTSTGEC 47 

>ref |XP_001764053 . 1 | HI predicted protein [Physcomitrella patens subsp. patens] 

gb | EDQ71192.il predicted protein [Physcomitrella patens subsp. patens] 
Length=374 

GENE ID: 5927198 PHYPADRAFT_77455 I hypothetical protein 
[Physcomitrella patens subsp. patens] (10 or fewer PubMed links) 



Query 1 RTSTGQC 7 

RTS+GQC 
Sbjct 189 RTSSGQC 195 

>ref |YP_001669438.1| FAD dependent oxidoreductase [Pseudomonas putida GB-1] 
gb| ABY99102 . 1 | E3 FAD dependent oxidoreductase [Pseudomonas putida GB-1] 
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GENE ID: 5870995 PputGBl_3210 I FAD dependent oxidoreductase 
[Pseudomonas putida GB-1] 

Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTS+GQC 
Sbjct 186 RTSSGQC 192 



>ref |XP_001756901 . 1 | iBE! predicted protein [Physcomitrella patens subsp. patens] 
dbj | BAB39467 . 1 | HI putative alpha-glucosidase [Physcomitrella patens subsp. pate 

gb | EDQ78132 . 1 | fH predicted protein [Physcomitrella patens subsp. patens] 
Length=916 

GENE ID: 5920253 PHYPADRAFT_2 05099 I hypothetical protein 
[Physcomitrella patens subsp. patens] (10 or fewer PubMed links) 

Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTSTG+C 
Sbjct 146 RTSTGEC 152 



>ref |XP_638862 . 1 | HI hypothetical protein DDB_G0283853 [ Dictyostelium discoideum 

gb | EAL65541.1| HI hypothetical protein DDB_G0283853 [Dictyostelium discoideum AX 
Length=628 

GENE ID: 8624260 DDB_G0283853 I hypothetical protein 
[Dictyostelium discoideum AX 4 ] (10 or fewer PubMed links) 

Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

T+TGQCK 
Sbjct 287 TNTGQCK 293 



>ref |XP_638860.1| HI hypothetical protein DDB_G0283869 [Dictyostelium discoideum 

gb | EAL65549.il E hypothetical protein DDB_G0283869 [Dictyostelium discoideum AX 
Length=1141 

GENE ID: 8624258 DDB_G0283869 I hypothetical protein 
[Dictyostelium discoideum AX 4 ] (10 or fewer PubMed links) 

Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

T+TGQCK 
Sbjct 798 TNTGQCK 804 



>ref |XP_638863.1| SH hypothetical protein DDB_G0283787 [Dictyostelium discoideu 

gb | EAL65508.il E hypothetical protein DDB_G0283787 [Dictyostelium discoideum AX 
Length=1064 

GENE ID: 8624261 DDB_G0283787 I hypothetical protein 
[Dictyostelium discoideum AX 4 ] (10 or fewer PubMed links) 

Score = 22.3 bits (45), Expect = 5541 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

T+TGQCK 
Sbjct 728 TNTGQCK 734 



>gb| EFB29915 . 1 | hypothetical protein PANDA 005925 [Ailuropoda melanoleuca] 
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Length=2869 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 



>gb | EFA82002.1| hypothetical protein PPL_05237 [Polysphondylium pallidum PN500] 
Length=1715 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTG+CK 
Sbjct 990 TSTGECK 996 



>gb | EFA82001 . 1 | carbohydrate-binding domain-containing protein [Polysphondylium 

pallidum PN500] 

Length=705 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTG+CK 
Sbjct 527 TSTGECK 533 



>ref | ZP_06141814 . 1 | pectinester ase [Ruminococcus flavefaciens FD-1] 
Length=374 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RT TGQC 
Sbjct 27 RTGTGQC 33 



>emb | CBH16504 . 1 | periodic tryptophan protein 2, putative; predicted WD40 repeat 

protein [Trypanosoma brucei gambiense DAL972] 

Length=939 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

R STGQC 
Sbjct 432 RASTGQC 438 



>ref |XP_002444714.1| B hypothetical protein SORBIDRAFT_07g026515 [Sorghum bicolo 

gb | EES14209.il B hypothetical protein SORBIDRAFT_07g026515 [Sorghum bicolor] 
Length=394 

GENE ID: 8060966 SORBIDRAFT_07g026515 I hypothetical protein [Sorghum bicolor] 
(10 or fewer PubMed links) 

Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 150 TSTGQC 155 



>ref | XP_002578344 . 1 | 111 transcription initiation factor brfl [Schistosoma manso 
emb | CAZ34582.1| transcription initiation factor brfl, putative [Schistosoma m 
Length=770 

GENE ID: 8348817 Smp_071570 I transcription initiation factor brfl 
[Schistosoma mansoni] (10 or fewer PubMed links) 
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Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 43 TSTGQC 48 



>ref |YP_002756665.1| B lipoprotein [Escherichia coli] 

gb| ACL52052 . 1 | GO lipoprotein [Escherichia coli] 
Length=356 

GENE ID: 7701488 p026VIR_pll4 I lipoprotein [Escherichia coli] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTST QC 
Sbjct 340 RTSTAQC 346 



>ref | YP_002731228 . 1 | HI cytochrome C family protein [Persephonella marina EX-HI] 

gb | ACO04288.il cytochrome C family protein [Persephonella marina EX-HI] 

Length=326 

GENE ID: 7675008 PERMA_1460 I cytochrome C family protein 
[Persephonella marina EX-HI] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 226 TSTGQC 231 



>ref |XP_001013897.2| El hypothetical protein TTHERM_01486 710 [Tetrahymena thermop 

gb|EAR93652.2| El hypothetical protein TTHERM_01486 710 [Tetrahymena thermophila 
SB210] 
Length=2346 

GENE ID: 7823083 TTHERM_01486710 I hypothetical protein 
[Tetrahymena thermophila] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 769 TSTGQC 774 



>ref | ZP_03726320 . 1 | DNA topoisomerase (ATP-hydrolyzing) [Opitutaceae bacterium T 
gb| EEG19652 . 1 | DNA topoisomerase (ATP-hydrolyzing) [Opitutaceae bacterium TAV2 ] 
Length=788 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 493 TSTGQC 498 



>gb| EEE55720 . 1 | hypothetical protein OsJ_04192 [Oryza sativa Japonica Group] 
Length=904 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 676 STGQCK 681 
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>ref |XP_002290185 . 1 | fH predicted protein [ Thalassiosira pseudonana CCMP1335] 

gb | EED91937.1| fH predicted protein [Thalassiosira pseudonana CCMP1335] 
Length=2183 

GENE ID: 7449574 THAPSDRAFT_22658 I hypothetical protein 
[Thalassiosira pseudonana CCMP1335] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1235 TSTGQC 1240 



>ref |YP_002418785.1| putative lipoprotein [Escherichia coli] 

emb | CAQ87359.il S putative lipoprotein [Escherichia coli] 
Length=355 

GENE ID: 7872117 pilL I putative lipoprotein [Escherichia coli EDla] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTST QC 
Sbjct 339 RTSTAQC 345 



>ref |XP_002412137.1| S hypothetical protein IscW_ISCW011495 [Ixodes scapularis] 

gb | EEC14721.1| B hypothetical protein IscW_ISCW011495 [Ixodes scapularis] 
Length=103 

GENE ID: 8036937 IscW_ISCW011495 I hypothetical protein [Ixodes scapularis] 
Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 37 STGQCK 42 



>ref |XP_002598165.1| 0] hypothetical protein BRAFLDRAFT_1233 1 4 [ Branchiostoma flo 

gb | EEN54177.1| E hypothetical protein BRAFLDRAFT_1233 1 4 [Branchiostoma floridae 
Length=700 

GENE ID: 7231595 BRAFLDRAFT_1 2 3314 I hypothetical protein 
[Branchiostoma floridae] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 608 STGQCK 613 



>ref |XP_002612022 . 1 | IlJE hypothetical protein BRAFLDRAFT_86993 [Branchiostoma fl 
gb | EEN68031.il El hypothetical protein BRAFLDRAFT_86993 [Branchiostoma floridae] 
Length=1268 

GENE ID: 7243150 BRAFLDRAFT_8 6993 I hypothetical protein 
[Branchiostoma floridae] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 270 TSTGQC 275 
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>ref |XP_002613108 . 1 | HHl hypothetical protein BRAFLDRAFT_89993 [Branchiostoma fl 
gb | EEN69117.il B hypothetical protein BRAFLDRAFT_89993 [Branchiostoma floridae] 
Length=219 

GENE ID: 7209147 BRAFLDRAFT_8 9993 I hypothetical protein 
[Branchiostoma floridae] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/8 (75%), Positives = 7/8 (87%), Gaps = 0/8 (0%) 

Query 1 RTSTGQCK 8 

RT +GQCK 
Sbjct 36 RTGSGQCK 43 



>ref |XP_002124394 . 1 | 13 PREDICTED: similar to integrin beta Hrl [Ciona intestinal 
Length=840 

GENE ID: 100180214 LOC100180214 I similar to integrin beta Hrl 
[Ciona intestinalis ] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 731 STGQCK 736 



>dbj | BAG65498.il unnamed protein product [Homo sapiens] 
Length=1149 

GENE ID: 2200 FBN1 I fibrillin 1 [Homo sapiens] (Over 100 PubMed links) 
Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 732 TSTGQC 737 



>gb|AAl66315.1| H LOC100158602 protein [Xenopus (Silurana) tropicalis] 
Length=883 

GENE ID: 100158602 LOC100158602 I hypothetical protein LOC100158602 
[Xenopus (Silurana) tropicalis] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 793 STGQCK 798 



>ref |XP_001912476.1| II unn amed protein product [Podospora anserina] 

emb | CAP59957.il nnamed protein product [Podospora anserina] 

Length=588 

GENE ID: 6197588 PODANSg09524 I hypothetical protein 
[Podospora anserina DSM 980] 

Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 388 TSTGQC 393 



>ref |XP_001907985.1| B unnamed protein product [Podospora anserina] 
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emb | CAP68658 . 1 | B unnamed protein product [Podospora anserina] 
Length=lll 

GENE ID: 6192331 PODANSg5020 I hypothetical protein 
[Podospora anserina DSM 980] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/8 (75%), Positives = 6/8 (75%), Gaps = 0/8 (0%) 

Query 1 RTSTGQCK 8 

RTST CK 
Sbjct 60 RTSTSHCK 67 



>ref |XP_001770300.1| EUl predicted protein [Physcomitrella patens subsp. patens] 

gb | EDQ64975 . 1 | III predicted protein [Physcomitrella patens subsp. patens] 
Length=559 

GENE ID: 5933481 PHYPADRAFT_84461 I hypothetical protein 
[Physcomitrella patens subsp. patens] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 304 STGQCK 309 

>ref |XP_001775575.1| COS p300/CBP acetyltransf erase-related protein [Physcomitre 
subsp. patens] 

gb | EDQ59658.il 0 p300/CBP acetyltransf erase-related protein [Physcomitrella pat 
subsp. patens] 
Length=1617 

GENE ID: 5938778 HAC1501 I p300/CBP acetyltransf erase-related protein 
[Physcomitrella patens subsp. patens] (10 or fewer PubMed links) 

Score =21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTG+CK 
Sbjct 1232 TSTGECK 1238 

>ref |XP_001785357 . 1 | LEE p300/CBP acetyltransf erase-related protein [Physcomitre 
subsp. patens] 

gb | EDQ49842.1| p300/CBP acetyltransf erase-related protein [Physcomitrella pat 
subsp. patens] 
Length=1082 

GENE ID: 5948557 HAC1502 I p300/CBP acetyltransf erase-related protein 
[Physcomitrella patens subsp. patens] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 7/7 (100%), Gaps = 0/7 (0%) 

Query 2 TSTGQCK 8 

TSTG+CK 
Sbjct 697 TSTGECK 703 



>ref | ZP_02094454 . 1 | hypothetical protein PEPMIC_01220 [Parvimonas micra ATCC 332 

gb | EDP23416.il hypothetical protein PEPMIC_01220 [Parvimonas micra ATCC 33270] 
Length=324 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 93 STGQCK 98 
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>ref |XP_963736.2 | Ell hypothetical protein NCU03244 [Neurospora crassa OR74A] 

gb|EAA34500.2| conserved hypothetical protein [Neurospora crassa OR74A] 

Length=531 

GENE ID: 3879885 NCU03244 I hypothetical protein [Neurospora crassa OR74A] 
(10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 343 TSTGQC 348 



>ref |XP_001502309.1| Kill PREDICTED: fibrillin 1 [Equus caballus] 
Length=2871 

GENE ID: 100055741 FBN1 I fibrillin 1 [Equus caballus] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 



>ref |ZP_01894398.1| hypothetical protein MDG893_13589 [Mar inobacter algicola DG8 

gb | EDM47477.1| hypothetical protein MDG893_13589 [Mar inobacter algicola DG893] 
Length=137 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 116 TSTGQC 121 



>gb | EDL80069.il fibrillin 1, isoform CRA_a [Rattus norvegicus] 

gb | EDL80070.il fibrillin 1, isoform CRA_a [Rattus norvegicus] 
Length=2872 

GENE ID: 83727 Fbnl I fibrillin 1 [Rattus norvegicus] 
(10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1843 TSTGQC 1848 



>gb | EDL80071.il fibrillin 1, isoform CRA_b [Rattus norvegicus] 

Length=2807 

GENE ID: 83727 Fbnl I fibrillin 1 [Rattus norvegicus] 
(10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1778 TSTGQC 1783 



>gb | EDL28139.il El fibrillin 1, isoform CRA_a [Mus musculus] 

gb | EDL28140.il 111 fibrillin 1, isoform CRA_b [Mus musculus] 
Length=2873 

GENE ID: 14118 Fbnl I fibrillin 1 [Mus musculus] (Over 100 PubMed links) 
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Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1844 TSTGQC 1849 



>ref |XP_001393188 . 1 | III hypothetical protein An08gl0840 [Aspergillus niger] 

emb | CAK45757 . 1 | HI hypothetical protein [Aspergillus niger] 
Length=329 

GENE ID: 4983398 An08gl0840 I hypothetical protein 
[Aspergillus niger CBS 513.88] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 292 TSTGQC 297 



>ref |XP_001369848 . 1 | llS PREDICTED: hypothetical protein [Monodelphis domestica] 
Length=2871 

GENE ID: 100025769 LOC100025769 I hypothetical protein LOC100025769 
[Monodelphis domestica] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1843 TSTGQC 1848 



>gb| EAY76601 . 1 | hypothetical protein Osl_04550 [Oryza sativa Indica Group] 
Length=895 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 667 STGQCK 672 



>ref |XP_001443583.1| Kill hypothetical protein [Paramecium tetraurelia strain d4- 

emb | CAK76186.il unnamed protein product [Paramecium tetraurelia] 
Length=2350 

GENE ID: 5029368 GSPATT00011759001 I hypothetical protein 
[Paramecium tetraurelia strain d4-2] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 2043 TSTGQC 2048 



>ref | ZP_01693530 . 1 | conserved protein [Microscilla marina ATCC 23134] 

gb | EAY25518.il conserved protein [Microscilla marina ATCC 23134] 
Length=193 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 98 TSTGQC 103 
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>gb | EAW77353.il 111 fibrillin 1 (Marfan syndrome), isoform CRA_a [Homo sapiens] 
Length=2869 

GENE ID: 2200 FBN1 I fibrillin 1 [Homo sapiens] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 



>ref |NP_000129.3| El fibrillin 1 precursor [Homo sapiens] 

sp | P35555 . 2 | FBN1_HUMAN El RecName : Full=Fibr illin-1 ; Flags: Precursor 

gb| EAW77354 . 1 | HI fibrillin 1 (Marfan syndrome), isoform CRA_b [Homo sapiens] 

gb|AAI46855.1| B Fibrillin 1 [Homo sapiens] 
gb| ACZ58372 . 1 | fibrillin 1 [Homo sapiens] 
Length=2871 

GENE ID: 2200 FBN1 I fibrillin 1 [Homo sapiens] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 



>ref |NP_032019.2| CHE fibrillin 1 [Mus musculus] 
emb | CAM22806.il d fibrillin 1 [Mus musculus] 
emb | CAM17722.il El fibrillin 1 [Mus musculus] 

Length=2873 

GENE ID: 14118 Fbnl I fibrillin 1 [Mus musculus] 

Score = 21.8 bits (44), Expect = 7434 
Identities = 6/6 (100%), Positives = 6/6 (100%), 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1844 TSTGQC 1849 



>ref |YP_856076.1| drug/metabolite exporter family transporter [Aeromonas hydro 
subsp. hydrophila ATCC 7966] 

gb|ABK39216.1| transporter, 10 TMS drug/metabolite exporter (DME) family [Aer 
hydrophila subsp. hydrophila ATCC 7966] 
Length=293 

GENE ID: 4487228 AHA_1538 I drug/metabolite exporter family transporter 
[Aeromonas hydrophila subsp. hydrophila ATCC 7966] (10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

R STGQC 
Sbjct 284 RASTGQC 290 



>ref |XP_001149266.1| IMG1 PREDICTED: fibrillin 1 [Pan troglodytes] 
Length=3004 

GENE ID: 453411 FBN1 I fibrillin 1 [Pan troglodytes] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 



(Over 100 PubMed links) 
Gaps = 0/6 (0%) 
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Sbjct 1975 TSTGQC 1980 



>ref |NP_001044899.1| !IH1 Os01g0865600 [Oryza sativa (japonica cultivar-group) ] 
Length=921 

GENE ID: 4324795 Os01g0865600 I Os01g0865600 [Oryza sativa Japonica Group] 
(10 or fewer PubMed links) 

Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 693 STGQCK 698 



>ref |XP_397600.3| liE PREDICTED: similar to family with sequence similarity 48, 

A isoform b [Apis mellifera] 

Length=1078 

GENE ID: 410031 LOC410031 I similar to family with sequence similarity 48, 
member A isoform b [Apis mellifera] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

R STGQC 
Sbjct 1005 RASTGQC 1011 



>ref |XP_392100.3| IliE PREDICTED: similar to CG16791-PA [Apis mellifera] 
Length=452 

GENE ID: 408555 LOC408555 I similar to CG16791-PA [Apis mellifera] 
(10 or fewer PubMed links) 

Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 419 TSTGQC 424 



>ref |XP_001100608.1| lilll PREDICTED: estrogen-related receptor beta isoform 2 [Ma 
Length=454 

GENE ID: 703799 ESRRB I estrogen-related receptor beta [Macaca mulatta] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 161 TSTGQC 166 



>ref |XP_001113107.1| lM3 PREDICTED: similar to fibrillin 1 precursor [Macaca mul 
Length=2871 

GENE ID: 714451 LOC714451 I similar to fibrillin 1 precursor [Macaca mulatta] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 



>ref | ZP_01306957 . 1 | hypothetical protein RED65_16411 [Oceanobacter sp. RED65] 

gb | EAT12438.il hypothetical protein RED65_16411 [Oceanobacter sp. RED65] 
Length=135 
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Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 109 TSTGQC 114 

>ref |XP_861706.1| llJSl PREDICTED: similar to Fibrillin-1 precursor isoform 3 [Can 

f amiliaris ] 

Length=2417 

GENE ID: 478293 FBN1 I fibrillin 1 [Canis lupus familiaris] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 

>ref |XP_535468 .2 1 IIS PREDICTED: similar to Fibrillin-1 precursor isoform 1 [Can 

familiaris ] 

Length=2871 

GENE ID: 478293 FBN1 I fibrillin 1 [Canis lupus familiaris] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 

>dbj | BAD92077.1| B fibrillin 1 variant [Homo sapiens] 
Length=830 

GENE ID: 2200 FBN1 I fibrillin 1 [Homo sapiens] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 277 TSTGQC 282 

>dbj | BAD81907 . 1 | bHLH transcription factor-like protein [Oryza sativa Japonica 
Group] 

db j | BAG90720 . 1 | unnamed protein product [Oryza sativa Japonica Group] 
db j | BAF06813 . 2 | Os01g0865600 [Oryza sativa Japonica Group] 
Length=904 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 676 STGQCK 681 

>gb| AAT94187 . 1 | PilL [Escherichia coli] 
Length=356 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTST QC 
Sbjct 340 RTSTAQC 346 

>ref |YP_308679.1| B lipoprotein [Escherichia coli] 
gb | AAL18827 . 2 | 111 Lipoprotein [Escherichia coli] 
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Length=356 

GENE ID: 3654428 pilL I lipoprotein [Escherichia coli] 
(10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTST QC 
Sbjct 340 RTSTAQC 346 



>gb|AAL05516.1|AF399919_3 PilL [Escherichia coli] 
Length=357 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/7 (85%), Positives = 6/7 (85%), Gaps = 0/7 (0%) 

Query 1 RTSTGQC 7 

RTST QC 
Sbjct 341 RTSTAQC 347 



>emb| CAD19195 . 1 | HI novel protein similar to MHC class II beta chain [Danio rerio 
Length=249 

GENE ID: 368615 si :busml-194el2 . 12 I si : busml-194el2 . 12 [Danio rerio] 
(10 or fewer PubMed links) 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 244 STGQCK 249 



>ref |XP_001030843.1| S hypothetical protein TTHERM_01 0 06510 [Tetrahymena thermop 

gb | EAR83180.il fH hypothetical protein TTHERM_01 0 06510 [Tetrahymena thermophila 
SB210] 
Length=1588 

GENE ID: 7842955 TTHERM_01006510 I hypothetical protein 
[Tetrahymena thermophila] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 287 STGQCK 292 



>gb| AAC62317 . 1 | HI mutant fibrillin-1 [Mus musculus] 
Length=3857 

GENE ID: 14118 Fbnl I fibrillin 1 [Mus musculus] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 2828 TSTGQC 2833 



>gb | AAA64217.1| El fibrillin-1 [Mus musculus] 

Length=2873 

GENE ID: 14118 Fbnl I fibrillin 1 [Mus musculus] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 
Query 2 TSTGQC 7 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 



1/4/2010 



NCBI Blast: Protein Sequence (8 letters) 



Page 23 



>ref |XP_861733 . 1 | E3 PREDICTED: similar to Fibrillin-1 precursor isoform 4 [Canis 

f amiliaris ] 

Length=2816 

GENE ID: 478293 FBN1 I fibrillin 1 [Canis lupus familiaris] 
Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 

>ref |XP_861765.1| B PREDICTED: similar to Fibrillin-1 precursor isoform 5 [Canis 

familiaris ] 

Length=2804 

GENE ID: 478293 FBN1 I fibrillin 1 [Canis lupus familiaris] 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1787 TSTGQC 1792 

>ref | XP_861792 . 1 | HI PREDICTED: similar to Fibrillin-1 precursor isoform 6 [Canis 

familiaris ] 

Length=2872 

GENE ID: 478293 FBN1 I fibrillin 1 [Canis lupus familiaris] 
Score =21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1843 TSTGQC 1848 

>ref |NP_114013.1| fill] fibrillin 1 [Rattus norvegicus] 

gb | AAD34438.il E fibrillin-1 [Rattus norvegicus] 
Length=2872 



Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 



Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1843 TSTGQC 1848 

>dbj | BAD16739.il 111 fibrillin 1 [Homo sapiens] 
Length=2871 

GENE ID: 2200 FBN1 I fibrillin 1 [Homo sapiens] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 

>ref |XP_002633929.1| Be. briggsae CBR-LAM-1 protein [Caenorhabditis briggsae] 
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emb | CAP37137.1| HI C. briggsae CBR-LAM-1 protein [Caenorhabditis briggsae] 
Length=1265 

GENE ID: 8575925 Cbr-lam-1 I C. briggsae CBR-LAM-1 protein 
[Caenorhabditis briggsae] 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 3 STGQCK 8 

STGQCK 
Sbjct 587 STGQCK 592 

>ref |NP_001001771.1| fibrillin 1 precursor [Sus scrofa] 

sp|Q9TV36.1|FBNl_PIG 13 RecName : Full=Fibrillin-l ; Flags: Precursor 

gb|AAD50328.1|AF073800_l 13 fibrillin-1 precursor [Sus scrofa] 
Length=2871 

GENE ID: 414836 FBN1 I fibrillin 1 [Sus scrofa] (10 or fewer PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 

>prf | | 1713408A fibrillin 
Length=1973 

Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 
Sbjct 944 TSTGQC 949 

>sp|Q61554.1|FBNl_MOUSE G RecName: Full=Fibr illin-1 ; Flags: Precursor 

gb | AAA56840.il E fibrillin [Mus musculus] 
Length=2871 

GENE ID: 14118 Fbnl I fibrillin 1 [Mus musculus] (Over 100 PubMed links) 
Score = 21.8 bits (44), Expect = 7434 

Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) 

Query 2 TSTGQC 7 

TSTGQC 

Sbjct 1842 TSTGQC 1847 
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